Travelling Salesman Problem with a Center 
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We study a travelling salesman problem where the path is optimized with a cost function that 
includes its length L as well as a certain measure C of its distance from the geometrical center of 
the graph. Using simulated annealing (SA) we show that such a problem has a transition point that 
separates two phases differing in the scaling behaviour of L and C, in efficiency of SA, and in the 
shape of minimal paths. 



In the travelling salesman problem (TSP), which is per- 
haps the most famous combinatorial optimization prob- 
lem, one has to find the shortest path that joins a given 
set of N points. In addition to pure academic interest 
TSP appears naturally in some transportation applica- 
tions or production and testing of integrated circuits . 
This easy to formulate problem is however very difficult 
to solve. Actually TSP is known to be NP-complete Q 
and it is believed that there is no algorithm that can 
find a solution in time increasing as a finite order poly- 
nomial in N . Although advanced algorithms can find 
exact solutions of TSP for quite a large values of N, 
these algorithms are usually dedicated only to this par- 
ticular task. Consequently, they are not suitable for more 
general versions of TSP nor for some other numerically 
diffcult optimization problems. It is thus desirable to de- 
velop approximate but more versatile methods such as 
genetic programming 0L simulated annealing (SA) 0] or 
extremal optimization [5j, and for such a purpose TSP 
appears to be an excellent testing ground. 

Another NP-complete problem that is of considerable 
interest is the so-called satisfiability problem where one 
examines conditions needed to satisfy certain Boolean 
formulae [6j. Recently, it was shown that there is a 
phase transition in this class of problems that separates 
two regimes: easy and hard to satisfy. It turns out that 
such problems are most difficult to examine right at the 
transition point and computational complexity decreases 
when one moves away of the transition point A 
related phase transition was found in a certain version 
of TSP where distances between points are randomly 
drawn integer numbers {I,2,..fc}. In such a case the 
time needed to find the minimal solution using branch- 
and-bound algorithm dramatically increases when r ex- 
ceeds a certain threshold value |g. Methods used to es- 
tablish these results very often originate from statistical 
mechanics providing thus an interesting multidisciplinary 
bridge |2j .Although some other examples of connections 
between statistical mechanics and computational com- 
plexity were already examined |l0| . further explorations 
of this subject would be very desirable. 

In the present paper we examine a certain version of 
TSP where one requires the minimization of E = L + rC, 
where L is the total length of a path, C is the sum 



of distances of middle points of links to a geometrical 
center of the graph, and r is a control parameter of 
the model (of course, r — corresponds to the origi- 
nal TSP). Using simulated annealing we show that upon 
increasing r the model undergoes a phase transition from 
the L(-dominated) phase (small r) to the C(-dominated) 
phase (large r). In the i-phase we find L ~ N 1 / 2 and 
C ~ N, while in the C-phase a reverse scaling holds with 
L ~ N and C ~ A 1 / 2 . Moreover, minimal paths in these 
two phases have qualitatively different shape. What is 
also interesting is the change of efficiency of SA. The in- 
dependent optimization problem that we study does not 
seem to be easier than the r = case, and we expect that 
for any r our problem is also NP-complete. Neverthe- 
less, in the C-phase SA has a much better efficiency than 
in the L-phase at least with respect to finding nearly- 
minimal paths. Such an effect might be related with a 
change of the energy landscape of the problem, but fur- 
ther studies would be needed to verify such a claim. 

To define our problem, let us consider N points dis- 
tributed in a unit square of a Euclidean plane. Denoting 
coordinates of i-th point as we have 

N 



L =J2y( x iH - x lk+1 ) 2 + {y lk -y 4fc+1 ) 2 , (1) 
fc=i 

^ N 

v ^ + Xi *+* - 1)2 + + Vi *+i - 1)2 ' (2) 

i=l 

where we assume that in a given path, points appear in 
the order . . . , «jV)*i and in+i = *i- Next, we intro- 

duce the cost function E = L + rC. We cannot provide 
an immediate application of such a problem, but one can 
imagine that in some transportation tasks staying dur- 
ing the tour close to (or far from ) the center might be 
of some importance. Such an additional condition might 
be particularly relevant in war areas where staying near 
a military base is important for safety reason. Another 
application might be a "just-in-time vehicle routing prob- 
lem" , in which truck drivers might be called to the center 
depot intermediately in order to get further goods that 
have to be delivered to the customers. It is easy to real- 
ize that minimization of L usually does not minimize C 
and competition of these two terms might lead to some 
interesting effects. 
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FIG. 1: The length L of the minimal path as a function of r 
for N = 100. The left inset shows square root of its variance 
<Ja as a function of r. The right inset shows the total distance 
from the center C as a function of r. 
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FIG. 2: Typical shapes of nearly minimal paths for (from left 
to right) r — 0.0, 2.0 and 2.5. Calculations were made for 
N = 100 and c = 10" 6 . 



To find a path that minimizes E we use a standard 
simulated annealing technique. For a given configura- 
tion, we produce a trial configuration by exchanging ran- 
domly positions of two points. Such a move is accepted 
with a probability min(l, e~ SE / T ) where 8E is the en- 
ergy difference between a trial and initial configuration, 
and T is a fictitious temperature. During simulations, 
the temperature T is reduced to zero linearly in time t 
according to T(t) = 1 — ct where c is the cooling rate and 
a unit of time is defined as an attempt to make N moves. 
For a small cooling rate c such a method finds a path 
of a low (and for small N perhaps even lowest) value of 
E. To check that the properties of our model are not the 
consequence of the numerical method, we did additional 
runs using the so-called Lin-2-Opt moves that cut a con- 
figuration in two pieces, turn the direction of one piece 
around, and reconnect . Such moves are known to be 
more effective than exchanges of two points 12]. Both 
methods, however, yields essentially the same behaviour 
of our model with respect to e.g., location of the transi- 
tion point or A^-dependence of L and C. Unless specified 
otherwise, described below results are obtained with the 
use of the exchange algorithm. 

In Fig. [I]we show the length L of a minimal path as 
a function of r averaged over 1000 distributions of N = 
100 points. One can see that L is a slowly increasing 
function of r except around r = 2.0 L where the increase 
is more abrupt. Around the same value of r there is the 
maximum of the variance of L and the abrupt change of 
C. These results suggest that a certain phase transition 
takes place around r = 2. 

A visual indication of a qualitative change around 
r ~ 2.0 is shown in Fig. For large r (C-dominated 
phase), paths that minimize E have large length L but 
many intersections around the center (1/2,1/2) yield a 



small value of C. For small r (L-dominated phase) typ- 
ical minimal paths have a much different shape. They 
have only few intersections and are more or less uniformly 
distributed in a unit square. 

That in our problem there are two phases separated by 
a transition point around ~ 2.0 is also seen in Fig.[3]which 
presents the Af-dependence of L and C. These results 
show that in the L-phase (r = 1.9) L ~ N 1 / 2 and C ~ N, 
Consequently, the cost function E is dominated by the 
distance C. Let us notice that the scaling L ~ A -1 / 2 
was already proven for the original TSP (r = 0) [l3| . 
As for the second relation (C ~ N), there is a simple 
argument that justifies it. Indeed, since in the L-phase 
a minimal path is relatively uniformly distributed in the 
unit square one can assume that central points of links are 
also almost uniformly distributed. It means that there 
is a positive (and ^-independent in the limit A~ — > oo) 
average distance of a central point from (1/2,1/2) and 
then C ~ N easily follows . 

In the C-phase we find L ~ N and C ~ N 1 / 2 . Such 
relations can be deduced from the structure of a typ- 
ical minimal path as seen in Fig. □ In the C-phase 
points that are approximately opposite with respect to 
(1/2,1/2) are pairwise linked. It means that there is a 
finite (A^-independent) length of such a link and thus 
L ~ A" follows. The relation C ~ \/N is more subtle 
and we can justify it only approximately. In the fol- 
lowing we argue that for a minimal path a typical dis- 
tance c of a center of a link to (1/2,1/2) scales as 1/yN 
from which the relation C ~ y/~N easily follows. To show 
that c ~ 1/VN let us consider e.g., a point (x\,yi) and 
ask what is a minimum c m of distances of a center of 
a link between (x±, y\) and any of the remaining A^ — 1 
points. It means that we have to find a minimum of 
\^J{xi +x % ~ l) 2 + (yi + Vi - l) 2 where i = 2,3...,N. 
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FIG. 3: Size dependence of L and C for r = 1.9, 2.0 and 2.5. 
Calculations made for c = 10~ 4 , 10~ 5 and 10~ 6 were extrap- 
olated to c = and average was made over 1000 independent 
samples. Continuous and dotted lines have slope 1.0 and 0.5, 
respectively. 



FIG. 5: The minimal distance c m from a center of a link to 
(1/2,1/2) as a function of 1/y/N. Links are made between 
a randomly chosen point (xi,yi) and N — 1 other randomly 
chosen points. Presented results are averages over 100 choices 
of (xi,yi). 
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FIG. 4: A link with a point inside A circle has a center inside 
B circle with a twice smaller radius. 

Let us notice that in a (two-dimensional) unit square 
N — 1 points set a characteristic distance between points 
as 1 / y/~N. Thus one can expect that in a circle of a ra- 
dius 1/y/N that is opposite (with respect to (1/2,1/2)) 
to the point (xi,yi) there is approximately one of these 
N — 1 points (see Fig.0J). A link with such a point has a 
center that is inside a circle around (1/2,1/2) and has a 
twice smaller radius (but still it is of the order of 1/y/N). 
Consequently, the smallest distance c m of a center of link 
to the point (1/2,1/2) should scale as 1/y/N and this 
argument is supported by simple numerical calculations 
that we present in Fig. 03 We expect that in the mini- 
mal path c also scales cLS Cm 1 ■ G . , clS 1/y/N and thus the 
relation C ~ y/~N follows. Let us notice that scaling of 
L and C in the C-phase is analogous to the scaling in 



the L-phase but with the role of L and C interchanged. 
We do not know whether this relation is accidental or in- 
dicates a deeper relation (e.g., certain duality) between 
these two phases. 

As it might be expected, the scaling of L and C at the 
transition point r = 2.0 is different than in each of the 
phases. Figure [3] shows that in this case both quantities 
increase linearly with N. 

Finally, we compare the efficiency of the simulated an- 
nealing in each phase. Our data (Fig. [fJJ show that in 
the C-phase SA is much more efficient and already with 
a relatively fast cooling a (nearly) minimal path is found. 
In the L=phase corrections due to the nonzero cooling- 
rate are much more important (our simulations show that 
these corrections in the L-phase decay approximately as 
c 1 / 3 ). For r = our data extrapolated to the limit of 
zero cooling rate give for N = 100 the average length of 
a minimal path as Lq = 7.8, which is in a good agree- 
ment with other results quoted in the literature [l5| . For 
comparison, we show also the results obtained using the 
Lin-2-Opt algorithm |ll| . This time simulations were 
made for N = 200 and the exponential cooling schedule 
was used T(t) = Tq(1 — c)*, whose effectiveness was al- 
ready demonstrated Results of these simulations, 
shown in Fig.0 confirm the change of efficiency of SA at 
the transition point of our model. 

A main difficulty with finding a minimal path in TSP 
is due to the complexity of the cost function and more 
specifically due to its many local minima. Some of these 
minima, being quite far from the global minimum, might 
be at the same time quite deep and the searching algo- 
rithm might get stuck in one of them. The landscape of 
the cost function in our problem is also very complex and 
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15 . _ of simulated annealing method, a phase transition in this 
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FIG. 7: The average normalized length L/Lo as a function of 
exponential cooling rate c, where Lo is the extrapolated value 
of L at the zero cooling rate (N = 200). Simulations were 
made using Lin-2-Opt moves. Let us notice that L converges 
to Lo linearly in c 1 ^ 2 . For the algorithm with exchange moves 
(Fig.||jJ convergence is slower and linear in c 1//3 . 



has many minima. However, a substantial increase in ef- 
ficiency of SA in the C-phase indicates a certain change 
in this landscape. One possibility is that in the C-phase 
local minima are located mainly in the vicinity of the 
global minimum. As a result, a solution found using SA, 
which is almost always only a nearly optimal path, is a 
good approximation of the optimal solution, In our opin- 
ion, such a transition for a model that most likely is NP- 
complete for any r is new and it would be desirable to 
have its better understanding. Another type of a tran- 
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